IN THE CLAIMS : 

The following is a complete list of the claims now pending; this listing replaces 
all earlier versions and listings of the claims. 



\ ^ (Currently Amended) A docxmient type definition generating method; 
comprising, in for generating a document type definition of a structured document provided with 
tcittg tags, each tag having an element name in for each document elemen t, said method 
comprising : 

a physical structure judging step^ of judging a similarity between 
physical st r ucture of ^ach structures of each of the document clement elements in the structured 
document : 

a semantic structure judging step^ of judging a similarity between 
semantic stmctur c of said e ach docum e nt clement structures of each of the tags by comparing the 
form of each tagged elenljent : and 

I document type definition generating step^ of gene r ating docum e nt 
typ e d e finition to define appearance state of th e document cl e ment in said structured docum e nt 
judging a similarity of the tags based on judgment results of said physical structure judging step 
and said semantic structure judging step , and generating the document type definition which 
unifies the element names of the similar tags . 

2. (Current W Amended) The A document type definition generating 
method according to claim 1, wherein said physical structure judging step comprises judging the 



physical structure of the document clement elements based on an mdcntion indentation or a blank 
line. \ 



3. (Currently Amended) ^¥hc A document type definition generating 
method according to claim 2, wherein^ when the physical structure of the document element is 
judged based on said indention tne indentation , the judging is performed by excluding the 
indention indentation which represents quotation. 

4. (Currently Akiended) The A document type definition generating 
method according to claim 2, wherein^ when the physical structure of the document element is 
judged based on said tiie blank line, the judging is performed by excluding the blank line from a 
the document in which a description is made\by constantly placing every predetermined number 
of blank lines. 



(Original) 



according to claim 1, wherein said\] 



The A document type definition generating method 

siical structure judging step comprises judging the physical 
structure of the document element B^se^ on a positional r elation relationship of the tags 
surrounding the document element. 



a semantic information database to judge the semantic structure of the document element based 
on connection of words and phrases connection in a the document and word types. 



7. (Original) The A document type definition generating method 
according to claim 1, wherein\aid semantic structure judging step comprises judging the 
semantic structure of the document element based on a meaning represented by the tags 
surrounding the document element. 



8. (Currently Amended) ftc A document type definition generating 
method according to claim 1, wherein said document type definition generating step comprises a 
redundancy removing step of, when the physical structure and the semantic structure of a 
plurality of document elements havingylfee tags different in element name are similar, regarding 
the document elements as being of the same type and excluding one element name from a 
document type definition generating object based on the judgment results of said physical 
structure judging step and said semantic structure judging step. 



9. (Currently Amended) The A document type definition generating 
method according to claim 8, wherein said redundancy removing step comprises obtaining 
similarity degrees concerning agreement degrees of the physical structure and the semantic 
structure between the document elements having mc tags different in element name, and 
regarding the document elements as being of the same type when a general similarity value 
calculated fi'om the similarity degrees is equal to or more than a predetermined threshold value. 




1 0. (Currently Amended) The A document type definition generating 

\ 

method according to claim 1, wherein said document type definition generating step comprises a 
title changing step of, when the physical structure and the semantic structure of a plurality of 
document elements having tite tags with the same element name are different, regarding the 
document elements as being of different types and changing one element name based on the 
judgment results of said physical structure judging step and said semantic structure judging step. 



1 (Original) The A document type definition generating method 
according to claim*!, whb:ein said document type definition generating step comprises analyzing 
words and phrases preset between a start tag and an end tag having the same title, obtaining 
information to be includedj^ety^een the tags, and generating the document type definition based 
on the information. 



(Currently Amended) A docxmient type definition generating apparatus 
comprising: in for generatingNa document type definition of a structured document provided v^th 
a-tag tags, each tag having an ment name in for each document element, said apparatus 
comprising: 

physical stricture judging means for judging a similarity between 
physical structure of said each structures of each of the document c l e m e nt elements in the 
structured document : 
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semantic structure judging means forjudging a similarity between 
semantic structure of said each document clem e nt structures of each of the tags bv comparing the 



form of each tagged element : and 

document type definition generating means for gene r ating docum e nt 



type definition to define appcai - ancc 



state of the document clement in said stmcturcd docum e nt 



judging a similarity of the tags based on judgment results of said physical structure judging 
means and said semantic structure judging means , and generating the document type definition 
which unifies the element names of the similar tags . 



1 3 . (Currently [Amended) The A document type definition generating 
whferein said physical structure judging means judges the 

c ment elements based on an indention indentation or a blank 



apparatus according to claim 12, 
physical structure of the document c: 
line. 



14. (Currently 
apparatus according to claim 13, whdrein 
physical structure of the document e 
the indention indentation which 



i tended) The A docximent type definition generating 
said physical structure judging means judges the 
jment based on said indention the indentation by excluding 



represents quotation. 



15. (Currently 
apparatus according to claim 13, whejrein 
physical structure of the document element 



Amended) The A document type definition generating 
said physical structure judging means judges the 
based on said the blank lines by excluding the blank 



lines from a document in i^hich description is made by constantly placing every predetermined 
number of blank lines. 



16. 

according to claim 12| 
structure of the doc 
document element. 




(Original) The A document type definition generating apparatus 
in said physical structure judging means judges the physical 
ement based on a positional relation of the tags surrounding the 
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\ \ ^ ^- (Currently Amended) The A document type definition generating 
apparatus according, to claim 12, wherein said semantic structure judging means refers to a 
semantic informationnatabase to judge the semantic structure of the document element based on 
connection of words and phrases comi c ction in a Ae document and word types. 

1 8. (Original) The A document type definition generating apparatus 
according to claim 12, wherem said semantic structure judging means judges the semantic 
structure of the document elemljnt based on a meaning represented by the tags surrounding the 
document element. 



1 9. (Currently ^^ended) The A document type definition generating 
apparatus according to claim 12, whereinSsaid document type definition generating means 
comprises redundancy removing means for, Vhen the physical structure and the semantic 
structure of a plurality of document elements having the tags different in element name are 



similar, regarding the document elements as being of the same type and excluding one element 
name from a document type definition generating object based on the judgment results of said 
physical structure judging means and said semantic structure judging means. 



20. (Currently Amended) The A document type definition generating 
apparatus according to claim 11 9, wherein said redundancy removing means obtains similarity 
degrees concerning agreement degrees of the physical structure and the semantic structure 
between the document elements having titc tags different in element name, and regards the 
document elements as being ob the same type when a general similarity value calculated from the 
similarity degrees is equal to orlmore than a predetermined threshold value. 

2 1 . (Currently Amended) The A document type definition generating 
apparatus according to claim 12, wherein said document type definition generating means 
comprises title changing means for, when the physical structure and the semantic structure of a 
plurality of document elements having the tags with the same element name are different, 
regarding the document elements as being of different types and changing one element name 
based on the judgment results of saio^ physical structure judging means and said semantic 
structure judging means. 



22. (Original) The A document type definition generating apparatus 
according to claim 12, wherein saidldocument type definition generating means analyzes words 
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and phrase^ present between a start tag and an end tag having the same title, obtains information 
to be 

included berA^en the tags, and generates the document type definition based on the information. 




^^|^\ I 2!^. (Currently Amended) A computer-readable storage medium storing a 
document ty^g dcfini^on generating program for controlling a computer to pe r fo r m execute a 
document type definition generation method for generating document type definition of a 
structured document prowded with tags, each tag having an element name for each document 
element, said program comprising cod e s for causing the comput e r to p e rform : 

ima sti"uctui 'e d document p r ovid e d with a tag having an elem e nt nam e 
in e ach docum e nt clem e nt, code for a physical structure judging step^ of judging a similarity 
between physical structure of each structures of each of the document elem e nt elements in the 
structured document : \ 

code for a semantic structure judging step^ of judging a similarity 
between semantic structure of said each document element structures of each of the tags bv 
comparing the form of each tagged\element : and 

code for a document type definition generating step^ of generating 
document ty p e d e finition to d e fine ap p eai - anc c state of the document clem e nt in said stmctui ' cd 
document judging a similarity of the tagte based on judgment results of said physical structure 
judging step and said semantic structure judging step , and generating the document type 
definition which imifies the element names>pf the similar tags . 
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